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NM_000312 1843 bp mRNA PRI 31-OCT-2000 

Homo sapiens protein C (inactivator of coagulation factors Va and 
Villa) (PROC), mRNA. 
NMJD00312 

NM_000312.1 GI:4506114 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1843) 
Foster D and Davie EW. 

Characterization of a cDNA coding for human protein C 
Proc. Natl. Acad. Sci. U.S.A. 81 (15), 4766-4770 (1984) 

84272714 
6589623 

2 (bases 1 to 1843)' 

Foster DC, Yoshitake S and Davie EW. 

The nucleotide sequence of the gene for human protein C 
Proc. Natl. Acad. Sci. U.S.A. 82 (14), 4673-4677 (1985) 

85270390 
2991887 

3 (bases 1 to 1843) 

Beckmann, R. J. , Schmidt , R. J . , Santerre, R. F. , Plutzky,J., 
Crabtree, G . R. and Long,G.L. 

The structure and evolution of a 4 61 amino acid human protein C 
precursor and its messenger RNA, based upon the DNA sequence of 
cloned human liver cDNAs 
Nucleic Acids Res. 13 (14), 

85269639 

4 (bases 1 to 1843) 



5233-5247 (1985) 



Patracchini P, Aiello V, Palazzi P, Calzolari E and Bernardi F. 

Sublocalization of the human protein C gene on chromosome 2ql3-ql4 

Hum. Genet. 81 (2), 191-192 (1989) 

89108444 

2912888 

PROVISIONAL REFSEQ: This record has not yet been subject to final 
NCBI review. The reference sequence was derived from X02750 . 1 . 
Location/Qualifiers 
1. .1843 

/organism="Homo sapiens" 
/db xref="taxon: 9606" 
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/chromosome-" 2" 
/map="2ql3-ql4" 
gene 1..1843 

/gene="PROC" 
/db_xref ="Locus ID : 5624 " 
/db xref="MIM: 176860 " 
CDS 98. . 1483 

/gene="PROC" 
/EC_number="3.4 .21. 69" 
/codon_start=l 
/db_xref="LocusID:5624" 
/db_xref = M MIM : 176860 " 

/product="protein C (inactivator of coagulation factors Va 
' and Villa) " ' 
' /protein id^" NP 000303.1 " 
/db_xref="GI : 4506115" 

/translation="MWQLTSLLLFVATWGISGTPAPLDSVFSSSERAHQVLRIRKRAN 
SFLEELRHSSLERECIEEICDFEEAKEIFQNVDDTLAFWSKHVDGDQCLVLPLEHPCA 
SLCCGHGTCIDGIGSFSCDCRSGWEGRFCQREVSFLNCSLDNGGCTHYCLEEVGWRRC 
SCAPGYKLGDDLLQCHPAVKFPCGRPWKRMEKKRSHLKRDTEDQEDQVDPRLIDGKMT 
RRGDSPWQVVLLDSKKKLACGAVLIHPSWVLTAAHCMDESKKLLVRLGEYDLRRWEKW 
ELDLDIKEVFVHPNYSKSTTDNDIALLHLAQPATLSQTIVPICLPDSGLAERELNQAG 
QETLVTGWGYHSSREKEAKRNRTFVLNFIKIPWPHNECSEVMSNMVSENMLCAGILG 
DRQDACEGDSGGPMVASFHGTWFLVGLVSWGEGCGLLHNYGVYTKVSRYLDWIHGHIR 
DKEAPQKSWAP " 
siq peptide 98 . .196 

167. .358 

/note="GLA; Region: Domain containing Gla 
( gamma-carboxyglut amate ) residues " 
197. .223 

/note="propeptide (aa -9 to -1)" 
224. .358 

/note- "gamma carboxylation domain (aa 1-4 5) " 
224 . . 688 

/note="light chain (aa 1-155)" 
272. .361 

/note="gla; Region: Vitamin K-dependent 
carboxylat ion/ gamma-carboxyglut amic (GLA) domain" 
359. .496 

/note="EGF-domain I (aa 46-91)" 
359. .493 

/ note-"EGF_CA; Region: Calcium-binding EGF-like domain" 
497. . 634 

/note="EGF-domain II (aa 92-137)" 
653. .730 

/note="activation peptide region (aa 144-169)" 
695. .1531 

/note="heavy chain (aa 57-419)" 
728. .1432 

/note="Tryp_SPc; Region: Trypsin-like serine protease" 
731. .1531 

/note="serine protease region (aa 170-419)" 
734... 1432 

/note="trypsin; Region: Trypsin" 
1759. .1764 
• . /note="polyA signal" 

BASE COUNT ,417 a 530 c 564 g 332 t 

ORIGIN 

1 ctgcaggggg gggggggggg gggggctgtc atggcggcag gacggcgaac ttgcagtatc 
61 tccacgaccc gcccctacag gtgccagtgc ctccagaatg tggcagctca caagcctcct 
121 gctgttcgtg gccacctggg gaatttccgg cacaccagct cctcttgact cagtgttctc 
181 cagcagcgag cgtgcccacc aggtgctgcg gatccgcaaa cgtgccaact ccttcctgga 
241 ggagctccgt cacagcagcc tggagcggga gtgcatagag gagatctgtg acttcgagga 
301 ggccaaggaa attttccaaa atgtggatga cacactggcc ttctggtcca agcacgtcga 
361 cggtgaccag tgcttggtct tgcccttgga gcacccgtgc gccagcctgt gctgcgggca 
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cggcacgtgc 
ccgcttctgc 
gcattactgc 
gggggacgac 
gcggatggag, 
agatccgcgg 
cctgctggac 
gctgacagcg 
tgacctgcgg 
ccccaactac 
cgccaccctc 
cgagctcaat 
agagaaggag 
cccgcacaat 
gggcatcctc 
ctccttccac 
ccttcacaac 
catcagagac 
ctgggctttt 
tgctgttctg 
gcacctgttg 
gggtggggag 
actctgttta 
aaaaaaaaaa 
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atcgacggca 
cagcgcgagg 
ctagaggagg 
ctcctgcagt 
aagaagcgca 
ctcattgatg 
tcaaagaaga 
gcccactgca 
cgctgggaga 
agcaagagca 
tcgcagacca 
caggccggcc 
gccaagagaa 
gagtgcagcg 
ggggaccggc 
ggcacctggt 
tacggcgttt 
aaggaagccc 
gcatggcaat 
tccttccatc 
tatgtcacat 
gagcagatcc 
tgaaaaagaa 
aaaaaaaccc 



// 



tcggcagctt 
tgagcttcct 
tgggctggcg 
gtcaccccgc 
gtcacctgaa 
ggaagatgac 
agctggcctg 
tggatgagtc 
agtgggagct 
ccaccgacaa 
tagtgcccat 
aggagaccct 
accgcacctt 
aggtcatgag 
aggatgcctg 
tcctggtggg 
acaccaaagt 
cccagaagag 
ggatgggaca 
cctcttttgg 
gccttatgaa 
aagttttgcg 
taaaaaacac 
ccccccgccc 



cagctgcgac 
caattgctcg 
gcgctgtagc 
agtgaagttc 
acgagacaca 
caggcgggga 
cggggcagtg 
caagaagctc 
ggacctggac 
tgacatcgca 
ctgcctcccg 
cgtgacgggc 
cgtcctcaac 
caacatggtg 
cgagggcgac 
cctggtgagc 
cagccgctac 
ctgggcacct 
ttaaagggac 
gctcttctgg 
tagaatctta 
gggtctaaag 
aaccacgaaa 
cccccccctg 



tgccgcagcg gctgggaggg 
ctggacaacg gcggctgcac 
tgtgcgcctg gctacaagct 
ccttgtggga ggccctggaa 
gaagaccaag aagaccaagt 
gacagcccct ggcaggtggt 
ctcatccacc cctcctgggt 
cttgtcaggc ttggagagta 
atcaaggagg tcttcgtcca 
ctgctgcacc tggcccagcc 
gacagcggcc ttgcagagcg 
tggggctacc acagcagccg 
ttcatcaaga ttcccgtggt 
tctgagaaca tgctgtgtgc 
agtggggggc ccatggtcgc 
tggggtgagg gctgtgggct 
ctcgactgga tccatgggca 



tagcgaccct 
atgtaacaag 
agggaagtaa 
actcctagag 
ctgtgtgtgt 



ccctgcaggg 
cacaccggcc 
catttactga 
caactctgtg 
tgagggggat 



aaaaaaaaaa aaaaaaaaaa 
cag 
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